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(1) GENERAL INFORMATION: 

(i) APPLICANT: Scott, Matthew P. 

Goodrich, Lisa V. 
Johnson, Ronald L. 

(ii) TITLE OF INVENTION: Mammalian Patched Gene and Its Use 

(iii) NUMBER OF SEQUENCES: 2 0 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

(B) STREET: 4 Embarcadero Center, Suite 3 4 00 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94111-4187 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1,25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Rowland, Bertram I 

(B) REGISTRATION NUMBER: 20,015 

(C) REFERENCE/ DOCKET NUMBER: A60190/BIR STAN171 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 781-1989 

(B) TELEFAX: (415) 398-3249 

(C) TELEX: 910277299 




(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AACCTATGGC ACCCCCCCCA ACCTTTCCTA ACAAAACCCC TTTATACCCC CTTAAATTTT 
CCACCCAAAC CCTGAACAGA AACCTTTTTA ACCCCCCCCA CCCGGAATTC CATCCCCCCC 



60 



120 




AAATTACAAC 


TCCAGCCAAA 


ATTAAAATTG 


GTCCTAACCT 


AACCATGTTG 


TTACGGTTTC 


180 


CCCCCCCAAA 


TACATGCACT 


GGCCCGAACA 


CTTGATCGTT 


GCCGTTCCAA 


TAAGAATAAA 


240 


TCTGGTCATA 


TTAAACAAGC 


CAAAGCTTTA 


CAAACTGTTG 


TACAATTAAT 


GGGCGAACAC 


300 


GAACTGTTCG 


AATTCTGGTC 


TGGACATTAC 


AAAGTGCACC 


ACATCGGATG 


GAACCAGGAG 


360 


.AAGGCCACAA 


CCGTACTGAA 


CGCCTGGCAG 


AAGAAGTTCG 


CACAGGTTGG 


TGGTTGGCGC 


420 


AAGGAGTAGA 


GTGAATGGTG 


GTAATTTTTG 


GTTGTTCCAG 


GAGGTGGATC 


GTCTGACGAA 


480 


GAGCAAGAAG 


TCGTCGAATT 


ACATCTTCGT 


GACGTTCTCC 


ACCGCCAATT 


TGAACAAGAT 


540 


GTTGAAGGAG 


GCGTCGAAAC 


GGACGTGGTG 


AAGCTGGGGG 


TGGTGCTGGG 


GGTGGCGGCG 


600 


GTGTACGGGT 


GGGTGGCCCA 


GTCGGGGCTG 


GCTGCCTTGG 


GAGTGCTGGT 


CTTGCGGCTC 


660 


ATTCGCCCTA 


TAGTAGCGTA 










680 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Pro Pro Pro Asn Tyr Asn Ser Xaa Pro Lys Xaa Xaa Xaa Leu Val 
15 10 15 

Leu Thr Pro Xaa Val Val Thr Val Ser Pro Pro Lys Tyr Met His Trp 
20 25 30 

Pro Glu His Leu lie Val Ala Val Pro lie Arg lie Asn Leu Val lie 
35 40 45 

Leu Asn Lys Pro Lys Ala Leu Gin Thr Val Val Gin Leu Met Gly Glu 
50 55 60 

His Glu Leu Phe Glu Phe Trp Ser Gly His Tyr Lys Val His His lie 
65 70 75 80 

Gly Trp Asn Gin Glu Lys Ala Thr Thr Val Leu Asn Ala Trp Gin Lys 

85 90 95 

Lys Phe Ala Gin Val Gly Gly Trp Arg Lys Glu 
100 105 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 



2 



(A) LENGTH: 4448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GGCCAGTGTC GACACGCTGT TAAGGTGTTA CGAAACTATC GAAACTCTGT ACCGTTCCGT 60 

GTTACATTCG GTCAGTGATA AACAGAAGTG AATAATTCGG GTGTACAAGT GTGTGGTTAT 12 0 

GTGGCGGCGA GTGATATGAC AGCGTGCGCC GCCAGACGGA TTCCAGCCGT CTTCATGAGG 18 0 

ATATCGGTGA CAAGCCTGGC CCCGAGATCA TGGTGGCTCC CGATTCCGAG GCTCCTTCGA 2 40 

ATCCTCGGAT AACGGCTGCA CACGAGAGCC CCTGCGCCAC CGAGGCGCGC CACAGTGCTG 3 00 

ATCTTTACAT ACGTACCAGT TGGGTGGACG CCGCACTAGC TCTCTCTGAA CTCGAAAAGG 3 60 

GTAACATCGA AGGAGGAAGA ACCTCTCTGT GGATACGAGC GTGGCTACAA GAACAGCTCT 42 0 

TTATTTTGGG CTGCTTTCTT CAAGGCGACG CGGGGAAAGT CCTCTTCGTT GCCATCCTCG 4 80 

TTCTGTCGAC GTTCTGCGTC GGTCTCAAGT CAGCACAAAT ACATACAAGG GTCGACCAAC 54 0 

TCTGGGTTCA AGAGGGTGGT AGATTAGAAG CCGAGTTGAA ATATACAGCG CAAGCTTTGG 6 00 

GCGAGGCGGA CTCCTCGACG CACCAGCTTG TCATACAAAC TGCCAAAGAT CCAGACGTCT 6 60 

CCCTGCTACA TCCAGGCGCG TTGCTTGAAC ACCTTAAGGT GGTGCACGCA GCGACTCGGG 72 0 

TGACAGTTCA CATGTACGAC ATTGAGTGGC GCCTCAAAGA CCTGTGCTAC AGCCCCAGCA 780 

TACCGGACTT CGAGGGTTAC CACCACATCG AGTCAATCAT AGACAACGTC ATCCCCTGCG 84 0 

CTATTATCAC CCCCCTTGAT TGCTTCTGGG AAGGCTCCAA GTTGCTTGGT C C CG ATT AT C 9 00 

CTATATACGT ACCACATCTT AAACACAAAC TACAATGGAC ACATTTAAAT CCATTGGAAG 9 60 

TTGTAGAAGA AGTGAAAAAA TTAAAGTTCC AATTTCCTCT GAGCACAATA GAGGCGTACA 102 0 

TGAAGAGAGC CGGCATCACT TCCGCCTACA TGAAAAAGCC GTGCTTAGAC CCCACCGACC 108 0 

CACATTGTCC AGCCACGGCT CCAAACAAAA AGTCTGGTCA TATTCCAGAT GTAGCGGCGG 114 0 

AGCTGTCGCA CGGATGTTAT GGTTTCGCGG CAGCTTACAT GCACTGGCCG GAACAGTTAA 12 00 

TTGTAGGGGG AGCTACAAGG AATTCGACAT CAGCTCTGAG AAAAGCACGC GTTTACAGAC 12 6 0 

TGTAGTACAG TTAATGGGCG AGAGAGAAAT GTACGAGTAC TGGGCCGATC ATTATAAAGT 13 2 0 

ACATCAAATT GGCTGGAATC AAGAGAAGGC AGCTGCTGTA CTGGATGCCT GGCAGAGAAA 13 8 0 

GTTTGCCGCT GAAGTCAGAA AAATTACTAC CTCAGGATCA GTATCATCGG CTTATAGTTT 14 4 0 



CTATCCGTTC TCCACCTCGA 
GAAGAACATT ATATTAGGCT 
ATGGCGGGAT CCGATTCGCT 
AATCACTGTT GCCGCTGGCT 
AAGTACGCAA ATAGTACCAT 
CACTCACACG TATGTGGAGC 
ATTGAAAAAG AGCGGTTTAA 
GGCAGCAGCC CTTCTACCTA 
TCTTCTGTTT AACTTGGGGT 
GCGACGGAGG TCAGCCGCGA 
ACCGAAGAAG AAAATTCCGG 
AGACACCACG AGACAACCTC 
CTTAAGCGTC TCGCTCACCA 
CGCTGTTAAG GTTACATCCA 
AGCGACAAAA GTAAAGGATG 
ACACGAATTT TTATCTCGTC 
GCAAGGCAAC TTTGAATATC 
CGTCAGAATA CCTAATATAA 
TTTATTCCGC GACTGGTTAT 
TTGTATAACA CAAGAGTATT 
ACTTATGGTG CAGACTGGCC 
TCACAGACTA GTTGACAAAG 
AGCTTGGGCT ACTAACGACG 
GCCCCAAAGA TGGATCCATT 
ATTAATTTAC ACACAGTTAC 
ACGTTGATAA GATCTGTGCG 
TTTCCATCGG GTATACCATT 
CTACTGGCTT TGGCGTGTGC 
AACGCCTGGG CAGCAGTACT 



CACTTAATGA CATACTCGGG 

ATATGTTTAT GTTAATTTAT 

CGCAAGCGGG TGTGGGTATC 

TAGGATTTTG TGCTTTATTA 

TCCTAGCGCT CGGGTTAGGA 

AAGCGGGAGA TGTGCCTAGA 

GCGTACTTCT GGCGTCTTTG 

TTCCAGCTTT CAGAGTATTT 

CAATATTACT GGTTTTTCCT 

GGGCCGATCT TTTATGCTGT 

AAAGAGCAAA AACTAGAAAA 

TAGACCCAGA TGTGTCCGAG 

AGTGGGCCAA GAACCAATAC 

TGTTAGCGTT GATTGCTGTT 

GATTGGATTT GACTGATATT 

AGGAAAAATA CTTTGGCTTC 

CCACCAATCA GAAGTTATTA 

TCAAGAATGA TAACGGCGGT 

TGGACTTGCA AGTGGCTTTT 

GGTGCAAAAA CGCGAGTGAC 

ATGTGGACAA TCCAATCGAT 

ACGGTATTAT AAATCCAAAG 

CGTTGGCATA CGGAGCCTCA 

CTCCGGAGGA TGTACATTTA 

CATTCTACCT TTCCGGTCTC 

AGATTTATGT CTGAAGTACG 

CCTTTTCTGG GAACAGTATT 

TTTGGCAGCT GTCTTCATTG 

GGTGACATTA GCGCTGGCTA 



AAGTTCTCCG AAGTGTCACT 

GTTGCCGTTA CTTTAATACA 

GCCGGAGTTC TACTACTATC 

GGCATACCAT TCAACGCTTC 

GTTCAAGATA TGTTTCTTCT 

GAAGAGAGGA CTGGACTTGT 

TGCAACGTGA TGGCATTTTT 

TGCCTACAGG CTGCCATACT 

GCTATGATCT CGTTAGACCT 

TTGATGCCTG AGAGTCCATT 

AACGATAAGA CT CAT AG GAT 

AACGTGACCA AAACTTGCTG 

GCGCCGTTCA TCATGCGCCC 

ATTCTGACTA GCGTTTGGGG 

GTACCGGAGA ATACAGACGA 

TATAATATGT ACGCCGTGAC 

TAT GAG TAT C ACGATCAATT 

CTCACGAAAT TTTGGTTGAG 

GATAAGGAGG TTGCCAGCGG 

GAAGGAATAT TGGCCTATAA 

AAGTCTCTGA TTACGGCAGG 

GCATTTTATA ATTACCTATC 

CAAGGGAACT TGAAACCTCA 

GAAATAAAGA AATCGTCGCC 

AGCGACACTA TAG CATC AAA 

AGGCGAAAGG TTTACCGAAC 

TGTATTTAAG GACATCTTTA 

CGGTTATGGT GCTATTGTTG 

CATTAGTACT GCAGCTGTTA 



GGTGTATGGC CTTATTGGGC GTGAAG CT AT CTGCAATGCC GGCCGTACTA CTGGTGCTAG 3 24 0 

CCATTGGGAG AGGAGTTCAC TTCACTGTGC ATTTATGTTT GGGTTTTGTA ACATCAATCG 3 3 00 

GTTGCAAGCG GCGCCGCGCG TCACTAGCTC TAGAATCAGT TCTGGCGCCA GTGGTGCACG 3 3 60 

GCGCTCTGGC GGCCGCGCTG GCTGCCTCGA TGCTAGCTGC AAGTGAATGT GGCTTCGTTG 3 42 0 

-CCAGACTGTT CTTGAGGTTA CTACTGGACA TCGTGTTTCT GGGACTCATC GATGGGTTGC 348 0 

TGTTCTTCCC TATTGTCCTT TCGATATTGG GACCGGCTGC TGAGGTACGA CCTATAGAGC 3 54 0 

ATCCAGAACG CTTATCGACT CCATCGCCAA AATGTTCGCC CATCCACCCT CGCAAATCAA 3 600 

GTTCCAGCTC AGGCGGTGGT GATAAATCAA GTCGAACCAG TAAATCAGCA CCAAGGCCTT 3 660 

GCGCACCATC TCTCACGACC ATTACTGAAG AGCCTTCGAG TTGGCACAGT TCCGCCCACT 372 0 

CCGTACAATC TTCTATGCAG TCGATAGTGG TCCAGCCGGA GGTGGTGGTC GAAACTACCA 3 78 0 

CGTATAATGG CAGCGATTCT GCTTCAGGAC GGTCGACGCC TACAAAGTCT TCACACGGTG 384 0 

GTGCTATCAC AACTACTAAG GTGACCGCCA CGGCAAATAT AAAGGTAGAA GTGGTGACAC 3 9 00 

CGAGTGACAG GAAATCGCGA CGTTCCTATC ATTACTATGA TCGTCGAAGG GATCGCGATG 3 960 

AAGATAGGGA TCGAGACCGT GAAAGGGACA GAGATCGCGA CAGGGATCGG GATAGGGATC 4 02 0 

GTGACCGGGA CAGGGATAGG GATAGAGAAC GATCGAGAGA ACGAGACAGG CGAGACCGAT 4 080 

ATAGAGACGA AAGGGACCAC CGAGCTTCGC CGAGAGAAAA ACGGCAGAGA TTCTGGACAT 414 0 

GAAAGTGATT CATCACGTCA TTAAGGATGA AGTACATTGC CAACTTGCCA GAAACGCTCG 42 0 0 

TACAGTCGCG GTATCTGAAC ATATACCGGT TGCGTTATTG AATAACTTGT AAATGATCTC 4 2 60 

TTTGATTCGG CTTAAACATA TTTTAGTTCA CATCCTGCAA CATCTTAATA CGGTTAGGGT 43 2 0 

TAATTTTAAG TCCATAATAG TCGTATGTGG TTGGTCCTTG CGTTAAGTGC ATTAGTTTTT 4 380 

TCCCATCACT GGGTCCGTGG CTTTTAATTT GGTTTAACGG GGGGGGACCT TTTTTTTTTT 44 4 0 

TTTTTTTG 44 4 8 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
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Met Val Ala Pro Asp Ser Glu Ala Pro Ser Asn Pro Arg lie Thr Ala 
1 5 10 15 

Ala His Glu Ser Pro Cys Ala Thr Glu Ala Arg His Ser Ala Asp Leu 
20 25 30 

Tyr lie Arg Thr Ser Trp Val Asp Ala Ala Leu Ala Leu Ser Glu Leu 
35 40 45 

Glu Lys Gly Asn lie Glu Gly Gly Arg Thr Ser Leu Trp lie Arg Ala 
50 55 60 

Trp Leu Gin Glu Gin Leu Phe lie Leu Gly Cys Phe Leu Gin Gly Asp 
65 70 75 80 

Ala Gly Lys Val Leu Phe Val Ala lie Leu Val Leu Ser Thr Phe Cys 

85 90 95 

Val Gly Leu Lys Ser Ala Gin lie His Thr Arg Val Asp Gin Leu Trp 
100 105 110 

Val Gin Glu Gly Gly Arg Leu Glu Ala Glu Leu Lys Tyr Thr Ala Gin 
115 120 125 

Ala Leu Gly Glu Ala Asp Ser Ser Thr His Gin Leu Val lie Gin Thr 
130 135 140 

Ala Lys Asp Pro Asp Val Ser Leu Leu His Pro Gly Ala Leu Leu Glu 
145 150 155 160 

His Leu Lys Val Val His Ala Ala Thr Arg Val Thr Val His Met Tyr 

165 170 175 

Asp lie Glu Trp Arg Leu Lys Asp Leu Cys Tyr Ser Pro Ser lie Pro 
180 185 190 

Asp Phe Glu Gly Tyr His His lie Glu Ser lie lie Asp Asn Val lie 
195 200 205 

Pro Cys Ala lie lie Thr Pro Leu Asp Cys Phe Trp Glu Gly Ser Lys 
210 215 220 

Leu Leu Gly Pro Asp Tyr Pro lie Tyr Val Pro His Leu Lys His Lys 
225 230 235 240 

Leu Gin Trp Thr His Leu Asn Pro Leu Glu Val Val Glu Glu Val Lys 

245 250 255 

Lys Leu Lys Phe Gin Phe Pro Leu Ser Thr lie Glu Ala Tyr Met Lys 
260 265 270 

Arg Ala Gly lie Thr Ser Ala Tyr Met Lys Lys Pro Cys Leu Asp Pro 
275 280 285 

Thr Asp Pro His Cys Pro Ala Thr Ala Pro Asn Lys Lys Ser Gly His 
290 295 300 

lie Pro Asp Val Ala Ala Glu Leu Ser His Gly Cys Tyr Gly Phe Ala 
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305 310 315 320 

Ala Ala Tyr Met His Trp Pro Glu Gin Leu lie Val Gly Gly Ala Thr 

325 330 335 

Arg Asn Ser Thr Ser Ala Leu Arg Lys Ala Arg Xaa Leu Gin Thr Val 
340 345 350 

Val Gin Leu Met Gly Glu Arg Glu Met Tyr Glu Tyr Trp Ala Asp His 
355 360 365 

Tyr Lys Val His Gin lie Gly Trp Asn Gin Glu Lys Ala Ala Ala Val 
370 375 380 

Leu Asp Ala Trp Gin Arg Lys Phe Ala Ala Glu Val Arg Lys lie Thr 
385 390 395 400 

Thr Ser Gly Ser Val Ser Ser Ala Tyr Ser Phe Tyr Pro Phe Ser Thr 

405 410 415 

Ser Thr Leu Asn Asp lie Leu Gly Lys Phe Ser Glu Val Ser Leu Lys 
420 425 430 

Asn lie lie Leu Gly Tyr Met Phe Met Leu lie Tyr Val Ala Val Thr 
435 440 445 

Leu lie Gin Trp Arg Asp Pro lie Arg Ser Gin Ala Gly Val Gly lie 
450 455 460 

Ala Gly Val Leu Leu Leu Ser lie Thr Val Ala Ala Gly Leu Gly Phe 
465 470 475 480 

Cys Ala Leu Leu Gly lie Pro Phe Asn Ala Ser Ser Thr Gin lie Val 

485 490 495 

Pro Phe Leu Ala Leu Gly Leu Gly Val Gin Asp Met Phe Leu Leu Thr 
500 505 510 

His Thr Tyr Val Glu Gin Ala Gly Asp Val Pro Arg Glu Glu Arg Thr 
515 520 525 

Gly Leu Val Leu Lys Lys Ser Gly Leu Ser Val Leu Leu Ala Ser Leu 
530 535 540 

Cys Asn Val Met Ala Phe Leu Ala Ala Ala Leu Leu Pro lie Pro Ala 
545 550 555 560 

Phe Arg Val Phe Cys Leu Gin Ala Ala lie Leu Leu Leu Phe Asn Leu 

565 570 ' 575 

Gly Ser lie Leu Leu Val Phe Pro Ala Met lie Ser Leu Asp Leu Arg 
580 585 590 

Arg Arg Ser Ala Ala Arg Ala Asp Leu Leu Cys Cys Leu Met Pro Glu 
595 600 605 

Ser Pro Leu Pro Lys Lys Lys lie Pro Glu Arg Ala Lys Thr Arg Lys 
610 615 620 



Asn Asp Lys Thr His Arg lie Asp Thr Thr Arg Gin Pro Leu Asp Pro 
625 630 635 640 

Asp Val Ser Glu Asn Val Thr Lys Thr Cys Cys Leu Ser Val Ser Leu 

645 650 655 

Thr Lys Trp Ala Lys Asn Gin Tyr Ala Pro Phe lie Met Arg Pro Ala 
660 665 670 

Val Lys Val Thr Ser Met Leu Ala Leu lie Ala Val lie Leu Thr Ser 
675 680 685 

Val Trp Gly Ala Thr Lys Val Lys Asp Gly Leu Asp Leu Thr Asp lie 
690 695 700 

Val Pro Glu Asn Thr Asp Glu His Glu Phe Leu Ser Arg Gin Glu Lys 
705 710 715 720 

Tyr Phe Gly Phe Tyr Asn Met Tyr Ala Val Thr Gin Gly Asn Phe Glu 

725 730 735 

Tyr Pro Thr Asn Gin Lys Leu Leu Tyr Glu Tyr His Asp Gin Phe Val 
740 745 750 

Arg lie Pro Asn lie lie Lys Asn Asp Asn Gly Gly Leu Thr Lys Phe 
755 760 765 

Trp Leu Ser Leu Phe Arg Asp Trp Leu Leu Asp Leu Gin Val Ala Phe 
770 775 780 

Asp Lys Glu Val Ala Ser Gly Cys lie Thr Gin Glu Tyr Trp Cys Lys 
785 790 795 800 

Asn Ala Ser Asp Glu Gly lie Leu Ala Tyr Lys Leu Met Val Gin Thr 

805 810 815 

Gly His Val Asp Asn Pro lie Asp Lys Ser Leu lie Thr Ala Gly His 
820 825 830 

Arg Leu Val Asp Lys Asp Gly lie lie Asn Pro Lys Ala Phe Tyr Asn 
835 840 845 

Tyr Leu Ser Ala Trp Ala Thr Asn Asp Ala Leu Ala Tyr Gly Ala Ser 
850 855 860 

Gin Gly Asn Leu Lys Pro Gin Pro Gin Arg Trp lie His Ser Pro Glu 
865 870 875 880 

Asp Val His Leu Glu lie Lys Lys Ser Ser Pro Leu lie Tyr Thr Gin 

885 890 895 

Leu Pro Phe Tyr Leu Ser Gly Leu Ser Asp Thr Xaa Ser lie Lys Thr 
900 905 910 

Leu lie Arg Ser Val Arg Asp Leu Cys Leu Lys Tyr Glu Ala Lys Gly 
915 920 925 

Leu Pro Asn Phe Pro Ser Gly lie Pro Phe Leu Phe Trp Glu Gin Tyr 



930 935 940 

Leu Tyr Leu Arg Thr Ser Leu Leu Leu Ala Leu Ala Cys Ala Leu Ala 
945 950 955 960 

Ala Val Phe lie Ala Val Met Val Leu Leu Leu Asn Ala Trp Ala Ala 

965 970 975 

Val Leu Val Thr Leu Ala Leu Ala Thr Leu Val Leu Gin Leu Leu Gly 
980 985 990 

Val Met Ala Leu Leu Gly Val Lys Leu Ser Ala Met Pro Ala Val Leu 
995 1000 1005 

Leu Val Leu Ala lie Gly Arg Gly Val His Phe Thr Val His Leu Cys 
1010 1015 1020 

Leu Gly Phe Val Thr Ser lie Gly Cys Lys Arg Arg Arg Ala Ser Leu 
1025 1030 1035 1040 

Ala Leu Glu Ser Val Leu Ala Pro Val Val His Gly Ala Leu Ala Ala 

1045 1050 1055 

Ala Leu Ala Ala Ser Met Leu Ala Ala Ser Glu Cys Gly Phe Val Ala 
1060 1065 1070 

Arg Leu Phe Leu Arg Leu Leu Leu Asp lie Val Phe Leu Gly Leu lie 
1075 1080 1085 

Asp Gly Leu Leu Phe Phe Pro lie Val Leu Ser lie Leu Gly Pro Ala 
1090 1095 1100 

Ala Glu Val Arg Pro lie Glu His Pro Glu Arg Leu Ser Thr Pro Ser 
1105 1110 1115 1120 

Pro Lys Cys Ser Pro lie His Pro Arg Lys Ser Ser Ser Ser Ser Gly 

1125 1130 1135 

Gly Gly Asp Lys Ser Ser Arg Thr Ser Lys Ser Ala Pro Arg Pro Cys 
1140 1145 1150 

Ala Pro Ser Leu Thr Thr lie Thr Glu Glu Pro Ser Ser Trp His Ser 
1155 1160 1165 

Ser Ala His Ser Val Gin Ser Ser Met Gin Ser lie Val Val Gin Pro 
1170 1175 1180 

Glu Val Val Val Glu Thr Thr Thr Tyr Asn Gly Ser Asp Ser Ala Ser 
1185 1190 1195 1200 

Gly Arg Ser Thr Pro Thr Lys Ser Ser His Gly Gly Ala lie Thr Thr 

1205 1210 1215 

Thr Lys Val Thr Ala Thr Ala Asn lie Lys Val Glu Val Val Thr Pro 
1220 1225 1230 

Ser Asp Arg Lys Ser Arg Arg Ser Tyr His Tyr Tyr Asp Arg Arg Arg 
1235 1240 1245 
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Asp Arg Asp Glu Asp Arg Asp Arg Asp Arg Glu Arg Asp Arg Asp Arg 
1250 1255 1260 

Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg 
1265 1270 1275 1280 

Glu Arg Ser Arg Glu Arg Asp Arg Arg Asp Arg Tyr Arg Asp Glu Arg 

1285 1290 1295 

Asp His Arg Ala Ser Pro Arg Glu Lys Arg Gin Arg Phe Trp Thr 
1300 1305 1310 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



CGAAACAAGA 


GAGCGAGTGA 


GAGTAGGGAG 


AGCGTCTGTG 


TTGTGTGTTG 


AGTGTCGCCC 


60 


ACGCACACAG 


GCGCAAAACA 


GTGCACACAG 


ACGCCCGCTG 


GGCAAGAGAG 


AGTGAGAGAG 


120 


AG AAAC AG CG 


GCGCGCGCTC 


GCCTAATGAA 


GTTGTTGGCC 


TGGCTGGCGT 


GCCGCATCCA 


180 


CGAGATACAG 


ATACATCTCT 


CATGGACCGC 


GACAGCCTCC 


CACGCGTTCC 


GGACACACAC 


240 


GGCGATGTGG 


TCGATGAGAA 


ATTATTCTCG 


GATCTTTACA 


TACGCACCAG 


CTGGGTGGAC 


300 


GCCCAAGTGG 


CGCTCGATCA 


GATAGATAAG 


GGCAAAGCGC 


GTGGCAGCCG 


CACGGCGATC 


360 


TATCTGCGAT 


CAGTATTCCA 


GTCCCACCTC 


GAAACCCTCG 


GCAGCTCCGT 


GCAAAAGCAC 


420 


GCGGGCAAGG 


TGCTATTCGT 


GGCTATCCTG 


GTGCTGAGCA 


CCTTCTGCGT 


CGGCCTGAAG 


480 


AGCGCCCAGA 


TCCACTCCAA 


GGTGCACCAG 


CTGTGGATCC 


AGGAGGGCGG 


CCGGCTGGAG 


540 


GCGGAACTGG 


CCTACACACA 


GAAGACGATC 


GGCGAGGACG 


AGTCGGCCAC 


GCATCAGCTG 


600 


CTCATTCAGA 


CGACCCACGA 


CCCGAACGCC 


TCCGTCCTGC 


ATCCGCAGGC 


GCTGCTTGCC 


660 


CACCTGGAGG 


TCCTGGTCAA 


GGCCACCGCC 


GTCAAGGTGC 


ACCTCTACGA 


CACCGAATGG 


720 


GGGCTGCGCG 


ACATGTGCAA 


CATGCCGAGC 


ACGCCCTCCT 


T CG AG G G CAT 


CTACTACATC 


780 


GAGCAGATCC 


TGCGCCACCT 


CATTCCGTGC 


TCGATCATCA 


CGCCGCTGGA 


CTGTTTCTGG 


840 


GAGGGAAGCC 


AGCTGTTGGG 


TCCGGAATCA 


GCGGTCGTTA 


TACCAGGCCT 


CAACCAACGA 


900 


CTCCTGTGGA 


CCACCCTGAA 


TCCCGCCTCT 


GTGATGCAGT 


ATATGAAACA 


AAAGATGTCC 


960 
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GAGGAAAAGA TCAGCTTCGA CTTCGAGACC GTGGAGCAGT ACATGAAGCG TGCGGCCATT 102 0 

GGCAGTGGCT ACATGGAGAA GCCCTGCCTG AACCCACTGA ATCCCAATTG CCCGGACACG 108 0 

GCACCGAACA AGAACAGCAC CCAGCCGCCG GATGTGGGAG CCATCCTGTC CGGAGGCTGC 114 0 

TACGGTTATG CCGCGAAGCA CATGCACTGG CCGGAGGAGC TGATTGTGGG CGGACGGAAG 12 00 

AGGAACCGCA GCGGACACTT GAGGAAGGCC CAGGCCCTGC AGTCGGTGGT GCAGCTGATG 12 6 0 

ACCGAGAAGG AAATGTACGA CCAGTGGCAG GACAACTACA AGGTGCACCA TCTTGGATGG 13 2 0 

ACGCAGGAGA AGGCAGCGGA GGTTTTGAAC GCCTGGCAGC GCAACTTTTC GCGGGAGGTG 13 8 0 

GAACAGCTGC TACGTAAACA GTCGAGAATT GCCACCAACT ACGATATCTA CGTGTTCAGC 144 0 

TCGGCTGCAC TGGATGACAT CCTGGCCAAG TTCTCCCATC CCAGCGCCTT GTCCATTGTC 150 0 

ATCGGCGTGG CCGTCACCGT TTTGTATGCC TTTTGCACGC TCCTCCGCTG GAGGGACCCC 156 0 

GTCCGTGGCC AGAGCAGTGT GGGCGTGGCC GGAGTTCTGC TCATGTGCTT CAGTACCGCC 162 0 

GCCGGATTGG GATTGT C AG C CCTGCTCGGT ATCGTTTTCA ATGCGCTGAC CGCTGCCTAT 168 0 

G C GG AG AG C A ATCGGCGGGA GCAGACCAAG CTGATTCTCA AGAACGCCAG CACCCAGGTG 174 0 

GTTCCGTTTT TGGCCCTTGG TCTGGGCGTC GATCACATCT TCATAGTGGG ACCGAGCATC 18 0 0 

CTGTTCAGTG CCTGCAGCAC CGCAGGATCC TTCTTTGCGG CCGCCTTTAT TCCGGTGCCG 186 0 

GCTTTGAAGG TATTCTGTCT GCAGGCTGCC ATCGTAATGT GCTCCAATTT GGCAGCGGCT 192 0 

CTATTGGTTT TTCCGGCCAT GATTTCGTTG GATCTACGGA GACGTACCGC CGGCAGGGCG 198 0 

GACATCTTCT GCTGCTGTTT TCCGGTGTGG AAGGAACAGC CGAAGGTGGC ACCTCCGGTG 2 04 0 

CTGCCGCTGA ACAACAACAA CGGGCGCGGG GCCCGGCATC CGAAGAGCTG CAACAACAAC 2100 

AGGGTGCCGC TGCCCGCCCA GAATCCTCTG CTGGAACAGA GGGCAGACAT CCCTGGGAGC 2160 

AGTCACTCAC TGGCGTCCTT CTCCCTGGCA ACCTTCGCCT TTCAGCACTA CACTCCCTTC 22 2 0 

CTCATGCGCA GCTGGGTGAA GTTCCTGACC GTTATGGGTT TCCTGGCGGC CCTCATATCC 2 2 80 

AG CTTGTATG CCTCCACGCG CCTTCAGGAT GGCCTGGACA TTATTGATCT GGTGCCCAAG 2 34 0 

GACAGCAACG AGCACAAGTT CCTGGATGCT CAAACTCGGC TCTTTGGCTT CTACAGCATG 2400 

TATGCGGTTA CCCAGGGCAA CTTTGAATAT CCCACCCAGC AGCAGTTGCT CAGGGACTAC 24 60 

CATGATTCCT TTGTGCGGGT GCCACATGTG ATCAAGAATG ATAACGGTGG ACTGCCGGAC 2 52 0 

TTCTGGCTGC TGCTCTTCAG CGAGTGGCTG GGTAATCTGC AAAAGATATT CGACGAGGAA 2 58 0 

TACCGCGACG GACGGCTGAC CAAGGAGTGC TGGTTCCCAA ACGCCAGCAG CGATGCCATC 2 64 0 

CTGGCCTACA AGCTAATCGT GCAAACCGGC CATGTGGACA ACCCCGTGGA CAAGGAACTG 2 7 00 
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# 



GTGCTCACCA ATCGCCTGGT CAACAGCGAT GGCATCATCA ACCAACGCGC CTTCTACAAC 2 760 

TATCTGTCGG CATGGGCCAC CAACGACGTC TTCGCCTACG GAGCTTCTCA GGGCAAATTG 282 0 

TATCCGGAAC CGCGCCAGTA TTTTCACCAA CCCAACGAGT ACGATCTTAA GATACCCAAG 2 88 0 

AGTCTGCCAT TGGTCTACGC TCAGATGCCC TTTTACCTCC ACGGACTAAC AGATACCTCG 294 0 

CAGATCAAGA CCCTGATAGG TCATATTCGC GACCTGAGCG TCAAGTACGA GGGCTTCGGC 3 000 

CTGCCCAACT ATCCATCGGG CATTCCCTTC ATCTTCTGGG AGCAGTACAT GACCCTGCGC 3 060 

TCCTCACTGG CCATGATCCT GGCCTGCGTG CTACTCGCCG CCCTGGTGCT GGTCTCCCTG 312 0 

CTCCTGCTCT CCGTTTGGGC CGCCGTTCTC GTGATCCTCA GCGTTCTGGC CTCGCTGGCC 318 0 

CAGATCTTTG GGGCCATGAC TCTGCTGGGC ATCAAACTCT CGGCCATTCC GGCAGTCATA 3 24 0 

CTCATCCTCA GCGTGGGCAT GATGCTGTGC TTCAATGTGC TGATATCACT GGGCTTCATG 3 3 00 

ACATCCGTTG GCAACCGACA GCGCCGCGTC C AG CTGAG C A TGCAGATGTC CCTGGGACCA 3 3 60 

CTTGTCCACG GCATGCTGAC CTCCGGAGTG GCCGTGTTCA TGCTCTCCAC GTCGCCCTTT 3 42 0 

GAGTTTGTGA TCCGGCACTT CTGCTGGCTT CTGCTGGTGG TCTTATGCGT TGGCGCCTGC 3 480 

AACAGCCTTT TGGTGTTCCC CATCCTACTG AGCATGGTGG GACCGGAGGC GGAGCTGGTG 3 54 0 

CCGCTGGAGC ATCCAGACCG CATATCCACG CCCTCTCCGC TGCCCGTGCG CAGCAGCAAG 3 600 

AGATCGGGCA AATCCTATGT GGTGCAGGGA TCGCGATCCT CGCGAGGCAG CTGCCAGAAG 3 660 

TCGCATCACC ACCACCACAA AGACCTTAAT GATCCATCGC TGACGACGAT CACCGAGGAG 3720 

CCGCAGTCGT GGAAGTCCAG CAACTCGTCC ATCCAGATGC CCAATGATTG GACCTACCAG 3 780 

CCGCGGGAAC AGCGACCCGC CTCCTACGCG GCCCCGCCCC CCGCCTATCA CAAGGCCGCC 3 840 

GCCCAGCAGC ACCACCAGCA TCAGGGCCCG CCCACAACGC CCCCGCCTCC CTTCCCGACG 3 9 00 

GCCTATCCGC CGGAGCTGCA GAGCATCGTG GTGCAGCCGG AGGTGACGGT GGAGACGACG 3 9 60 

CACTCGGACA GCAACACCAC CAAGGTGACG GCCACGGCCA ACATCAAGGT GGAGCTGGCC 4 02 0 

ATGCCCGGCA GGGCGGTGCG CAGCTATAAC TTTACGAGTT AGCACTAGCA CTAGTTCCTG 4 08 0 

TAGCTATTAG GACGTATCTT TAGACTCTAG CCTAAGCCGT AACCCTATTT GTATCTGTAA 414 0 

AATCGATTTG TCCAGCGGGT CTGCTGAGGA TTTCGTTCTC ATGGATTCTC ATGGATTCTC 42 00 

ATGGATGCTT AAATGGCATG GTAATTGGCA AAATATCAAT TTTTGTGTCT CAAAAAGATG 42 6 0 

CATTAGCTTA TGGTTTCAAG ATACATTTTT AAAGAGTCCG CCAGATATTT ATATAAAAAA 432 0 

AATCCAAAAT CGACGTATCC ATGAAAATTG AAAAGCTAAG CAGACCCGTA TGTATGTATA 4 3 80 

TGTGTATGCA TGTTAGTTAA TTTCCCGAAG TCCGGTATTT AT AG C AG CTG CCTT 44 3 4 
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(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asp Arg Asp Ser Leu Pro Arg Val Pro Asp Thr His Gly Asp Val 
15 10 15 

Val Asp Glu Lys Leu Phe Ser Asp Leu Tyr lie Arg Thr Ser Trp Val 
20 25 30 

Asp Ala Gin Val Ala Leu Asp Gin lie Asp Lys Gly Lys Ala Arg Gly 
35 40 45 

Ser Arg Thr Ala lie Tyr Leu Arg Ser Val Phe Gin Ser His Leu Glu 
50 55 60 

Thr Leu Gly Ser Ser Val Gin Lys His Ala Gly Lys Val Leu^Phe Val 
65 70 75 80 

Ala lie Leu Val Leu Ser Thr Phe Cys Val Gly Leu Lys Ser Ala Gin 

85 90 95 

lie His Ser Lys Val His Gin Leu Trp lie Gin Glu Gly Gly Arg Leu 
100 105 110 

Glu Ala Glu Leu Ala Tyr Thr Gin Lys Thr lie Gly Glu Asp Glu Ser 
115 120 125 

Ala Thr His Gin Leu Leu lie Gin Thr Thr His Asp Pro Asn Ala Ser 
130 135 140 

Val Leu His Pro Gin Ala Leu Leu Ala His Leu Glu Val Leu Val Lys 
145 150 155 160 

Ala Thr Ala Val Lys Val His Leu Tyr Asp Thr Glu Trp Gly Leu Arg 

165 170 175 

Asp Met Cys Asn Met Pro Ser Thr Pro Ser Phe Glu Gly lie Tyr Tyr 
180 185 190 

lie Glu Gin lie Leu Arg His Leu lie Pro Cys Ser lie lie Thr Pro 
195 200 205 

Leu Asp Cys Phe Trp Glu Gly Ser Gin Leu Leu Gly Pro Glu Ser Ala 
210 ' 215 220 

Val Val lie Pro Gly Leu Asn Gin Arg Leu Leu Trp Thr Thr Leu Asn 
225 230 235 240 
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Pro Ala Ser Val Met Gin Tyr Met Lys Gin Lys Met Ser Glu Glu Lys 

245 250 255 

lie Ser Phe Asp Phe Glu Thr Val Glu Gin Tyr Met Lys Arg Ala Ala 
260 265 270 

lie Gly Ser Gly Tyr Met Glu Lys Pro Cys Leu Asn Pro Leu Asn Pro 
275 280 285 

Asn Cys Pro Asp Thr Ala Pro Asn Lys Asn Ser Thr Gin Pro Pro Asp 
290 295 300 

Val Gly Ala lie Leu Ser Gly Gly Cys Tyr Gly Tyr Ala Ala Lys His 
305 310 315 320 

Met His Trp Pro Glu Glu Leu lie Val Gly Gly Arg Lys Arg Asn Arg 

325 330 335 

Ser Gly His Leu Arg Lys Ala Gin Ala Leu Gin Ser Val Val Gin Leu 
340 345 350 

Met Thr Glu Lys Glu Met Tyr Asp Gin Trp Gin Asp Asn Tyr Lys Val 

355 360 365 

His His Leu Gly Trp Thr Gin Glu Lys Ala Ala Glu Val Leu Asn Ala 
370 375 380 

Trp Gin Arg Asn Phe Ser Arg Glu Val Glu Gin Leu Leu Arg Lys Gin 
385 390 . 395 400 

Ser Arg lie Ala Thr Asn Tyr Asp lie Tyr Val Phe Ser Ser Ala Ala 

405 410 415 

Leu Asp Asp lie Leu Ala Lys Phe Ser His Pro Ser Ala Leu Ser lie 
420 425 430 

Val lie Gly Val Ala Val Thr Val Leu Tyr Ala Phe Cys Thr Leu Leu 
435 440 445 

Arg Trp Arg Asp Pro Val Arg Gly Gin Ser Ser Val Gly Val Ala Gly 
450 455 460 

Val Leu Leu Met Cys Phe Ser Thr Ala Ala Gly Leu Gly Leu Ser Ala 
465 470 475 480 

Leu Leu Gly lie Val Phe Asn Ala Leu Thr Ala Ala Tyr Ala Glu Ser 

485 490 495 

Asn Arg Arg Glu Gin Thr Lys Leu lie Leu Lys Asn Ala Ser Thr Gin 
500 505 510 

Val Val Pro Phe Leu Ala Leu Gly Leu Gly Val Asp His lie Phe lie 
. 515 520 525 

Val Gly Pro Ser lie Leu Phe Ser Ala Cys Ser Thr Ala Gly Ser Phe 
530 535 540 

Phe Ala Ala Ala Phe lie Pro Val Pro Ala Leu Lys Val Phe Cys Leu 

14 



545 



550 



555 



560 



Gin Ala Ala lie Val Met Cys Ser Asn Leu Ala Ala Ala Leu Leu Val 

565 570 575 

Phe Pro Ala Met lie Ser Leu Asp Leu Arg Arg Arg Thr Ala Gly Arg 
580 585 590 

Ala Asp lie Phe Cys Cys Cys Phe Pro Val Trp Lys Glu Gin Pro Lys 
595 600 605 

Val Ala Pro Pro Val Leu Pro Leu Asn Asn Asn Asn Gly Arg Gly Ala 
610 615 620 

Arg His Pro Lys Ser Cys Asn Asn Asn Arg Val Pro Leu Pro Ala Gin 
625 630 635 640 

Asn Pro Leu Leu Glu Gin Arg Ala Asp lie Pro Gly Ser Ser His Ser 

645 650 655 

Leu Ala Ser Phe Ser Leu Ala Thr Phe Ala Phe Gin His Tyr Thr Pro 
660 665 670 

Phe Leu Met Arg Ser Trp Val Lys Phe Leu Thr Val Met Gly Phe Leu 
675 680 685 

Ala Ala Leu lie Ser Ser Leu Tyr Ala Ser Thr Arg Leu Gin Asp Gly 
690 695 700 

Leu Asp lie lie Asp Leu Val Pro Lys Asp Ser Asn Glu His Lys Phe 
705 710 715 720 

Leu Asp Ala Gin Thr Arg Leu Phe Gly Phe Tyr Ser Met Tyr Ala Val 

725 730 735 

Thr Gin Gly Asn Phe Glu Tyr Pro Thr Gin Gin Gin Leu Leu Arg Asp 
740 745 750 

Tyr His Asp Ser Phe Val Arg Val Pro His Val lie Lys Asn Asp Asn 
755 760 765 

Gly Gly Leu Pro Asp Phe Trp Leu Leu Leu Phe Ser Glu Trp Leu Gly 
770 775 780 

Asn Leu Gin Lys lie Phe Asp Glu Glu Tyr Arg Asp Gly Arg Leu Thr 
785 790 795 800 

Lys Glu Cys Trp Phe Pro Asn Ala Ser Ser Asp Ala lie Leu Ala Tyr 

805 810 815 

Lys Leu lie Val Gin Thr Gly His Val Asp Asn Pro Val Asp Lys Glu 
820 825 830 

Leu Val Leu Thr Asn Arg Leu Val Asn Ser Asp Gly lie lie Asn Gin 
835 840 845 

Arg Ala Phe Tyr Asn Tyr Leu Ser Ala Trp Ala Thr Asn Asp Val Phe 
850 855 860 
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Ala Tyr Gly Ala Ser Gin Gly Lys Leu Tyr Pro Glu Pro Arg Gin Tyr 
865 870 875 880 

Phe His Gin Pro Asn Glu Tyr Asp Leu Lys lie Pro Lys Ser Leu Pro 

885 890 895 

Leu Val Tyr Ala Gin Met Pro Phe Tyr Leu His Gly Leu Thr Asp Thr 
900 905 910 

Ser Gin lie Lys Thr Leu lie Gly His lie Arg Asp Leu Ser Val Lys 
915 920 925 

Tyr Glu Gly Phe Gly Leu Pro Asn Tyr Pro Ser Gly lie Pro Phe lie 
930 935 940 

Phe Trp Glu Gin Tyr Met Thr Leu Arg Ser Ser Leu Ala Met lie Leu 
945 950 955 960 

Ala Cys Val Leu Leu Ala Ala Leu Val Leu Val Ser Leu Leu Leu. Leu 

965 970 975 

Ser Val Trp Ala Ala Val Leu Val lie Leu Ser Val Leu Ala Ser Leu 
980 985 990 

Ala Gin lie Phe Gly Ala Met Thr Leu Leu Gly lie Lys Leu Ser Ala 
995 1000 1005 

lie Pro Ala Val lie Leu lie Leu Ser Val Gly Met Met Leu Cys Phe 
1010 1015 1020 

Asn Val Leu lie Ser Leu Gly Phe Met Thr Ser Val Gly Asn Arg Gin 
1025 1030 1035 1040 

Arg Arg Val Gin Leu Ser Met Gin Met Ser Leu Gly Pro Leu Val His 

1045 1050 1055 

Gly Met Leu Thr Ser Gly Val Ala Val Phe Met Leu Ser Thr Ser Pro 
1060 1065 1070 

Phe Glu Phe Val lie Arg His Phe Cys Trp Leu Leu Leu Val Val Leu 
1075 1080 1085 

Cys Val Gly Ala Cys Asn Ser Leu Leu Val Phe Pro lie Leu Leu Ser 
1090 1095 1100 

Met Val Gly Pro Glu Ala Glu Leu Val Pro Leu Glu His Pro Asp Arg 
1105 1110 1115 1120 

lie Ser Thr Pro Ser Pro Leu Pro Val Arg Ser Ser Lys Arg Ser Gly 

1125 1130 1135 

Lys Ser Tyr Val Val Gin Gly Ser Arg Ser Ser Arg Gly Ser Cys Gin 
1140 1145 1150 

Lys Ser His His His His His Lys Asp Leu Asn Asp Pro Ser Leu Thr 
1155 1160 1165 

Thr lie Thr Glu Glu Pro Gin Ser Trp Lys Ser Ser Asn Ser Ser lie 
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1170 1175 1180 

Gin Met Pro Asn Asp Trp Thr Tyr Gin Pro Arg Glu Gin Arg Pro Ala 
1185 1190 1195 1200 



Ser Tyr Ala Ala Pro Pro Pro Ala Tyr His Lys Ala Ala Ala Gin Gin 

1205 1210 1215 

His His Gin His Gin Gly Pro Pro Thr Thr Pro Pro Pro Pro Phe Pro 
1220 1225 1230 

Thr Ala Tyr Pro Pro Glu Leu Gin Ser lie Val Val Gin Pro Glu Val 
1235 1240 1245 

Thr Val Glu Thr Thr His Ser Asp Ser Asn Thr Thr Lys Val Thr Ala 
1250 1255 1260 

Thr Ala Asn lie Lys Val Glu Leu Ala Met Pro Gly Arg Ala Val Arg 
1265 1270 1275 1280 

Ser Tyr Asn Phe Thr Ser 

1285 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AAGGTCCATC AGCTTTGGAT ACAGGAAGGT GGTTCGCTCG AGCATGAGCT AGCCTACACG 60 



CAGAAATCGC TCGGCGAGAT GGACTCCTCC ACGCACCAGC TGCTAATCCA AACCCCAAAG 12 0 

ATATGGACGC CTCGATACTG CACCCGAACG CGCTACTGAC GCACCTGGAC GTGGTGAAGA 180 

AAGCGATCTC GGTGACGGTG CACATGTACG ACATCACGTG G AG CT C AAGG ACATGTGCTA 2 4 0 

CTCGCCCAGC ATACCGAGTT CGATACGCAC TTTATCGAGC AGATCTTCGA G AA CAT CAT A 300 



CCGTGCGCGA TCATCACGCC GCTGGATTGC TTTTGGGAGG GA 342 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Lys Val His Gin Leu Trp lie Gin Glu Gly Gly Ser Leu Glu His Glu 
1 5 10 15 



Leu Ala Tyr Thr Gin Lys Ser Leu 
20 

Gin Leu Leu lie Gin Thr Pro Lys 

35 40 

Pro Asn Ala Leu Leu Thr His Leu 
50 55 

Val Thr Val His Met Tyr Asp lie 
65 70 

Tyr Ser Pro Ser lie Pro Xaa Phe 

85 

Phe Glu Asn lie lie Pro Cys Ala 
100 



Gly Glu Met Asp Ser Ser Thr His 
25 30 

Asp Met Asp Ala Ser lie Leu His 

45 

Asp Val Val Lys Lys Ala lie Ser 
60 

Thr Trp Xaa Leu Lys Asp Met Cys 

75 ' 80 

Asp Thr His Phe lie Glu Gin lie 
90 95 

lie lie Thr Pro Leu Asp Cys Phe 
105 110 



Trp Glu Gly 
115 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGATGTTAAT ATTCTCATCG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Met Ala Ser Ala 
1 

Arg Arg Glu Ala 
20 

Pro Gly Leu Ser 
35 

Glu Gin lie Ser 
50 

Leu Arg Ala Lys 
65 

Gin Lys Asn Cys 



Ala Phe Ala Val 
100 

Glu Leu Trp Val 
115 

Thr Arg Gin Lys 
130 

lie Gin Thr Pro 
145 

Leu Leu Gin His 



Tyr Met Tyr Asn 
180 

Gly Glu Leu lie 
195 

Leu Tyr Pro Cys 
210 

Ala Lys Leu Gin 
225 

Arg Trp Thr Asn 



lie Asn Tyr Gin 
260 

Val Gly His Gly 
275 

Asp Cys Pro Ala 
290 

Val Ala Leu Val 




Gly Asn Ala Arg 

5 

Gin Thr Asp Arg 



Ala Pro Ala Gin 
40 

Lys Gly Lys Ala 
55 

Phe Gin Arg Leu 
70 

Gly Lys Phe Leu 
85 

Gly Leu Lys Ala 



Glu Val Gly Gly 
120 

lie Gly Glu Glu 
135 

Lys Glu Glu Gly 
150 

Leu Asp Ser Ala 
165 

Arg Gin Trp Lys 



Thr Glu Thr Gly 
200 

Leu lie lie Thr 
215 

Ser Gly Thr Ala 
230 

Phe Asp Pro Leu 
245 

Val Asp Ser Trp 



Tyr Met Asp Arg 
280 

Thr Ala Pro Asn 
295 

Leu Asn Gly Gly 




Arg Gly Pro Gly 
10 

Gly Thr Ala Pro 
25 

Leu Leu Arg Arg 



Thr Gly Arg Lys 
60 

Leu Phe Lys Leu 
75 

Val Val Gly Leu 
90 

Ala Asn Leu Glu 
105 

Arg Val Ser Arg 



Ala Met Phe Asn 
140 

Ala Asn Val Leu 
155 

Leu Gin Ala Ser 
170 

Leu Glu His Leu 
185 

Tyr Met Asp Gin 



Pro Leu Asp Cys 
220 

Tyr Leu Leu Gly 
235 

Glu Phe Leu Glu 
250 

Glu Glu Met Leu 
265 

Pro Cys Leu Asn 



Lys Asn Ser Thr 
300 

Cys Gin Gly Leu 



Gin Ala Gly Arg 
15 

Arg Arg Ala Gly 
30 

Ala Phe Ala Leu 
45 

Ala Pro Leu Trp 



Gly Cys Tyr lie 
80 

Leu lie Phe Gly 
95 

Thr Asn Val Glu 
110 

Glu Leu Asn Tyr 
125 

Pro Gin Leu Met 



Thr Thr Glu Ala 
160 

Arg Val His Val 
175 

Cys Tyr Lys Ser 
190 

lie lie Glu Tyr 
205 

Phe Trp Glu Gly 



Lys Pro Pro Leu 
240 

Glu Leu Lys Lys 
255 

Asn Lys Ala Glu 
270 

Pro Ala Asp Pro 

285 

Lys Pro Leu Asp 



Ser Arg Lys Tyr 
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305 310 315 320 

Met His Trp Gin Glu Glu Leu lie Val Gly Gly Thr Val Lys Asn Ala 

325 330 335 

Thr Gly Lys Leu Val Ser Ala His Ala Leu Gin Thr Met Phe Gin Leu 
340 345 350 

Met Thr Pro Lys Gin Met Tyr Glu His Phe Arg Gly Tyr Asp Tyr Val 
355 360 365 

Ser His lie Asn Trp Asn Glu Asp Arg Ala Ala Ala lie Leu Glu Ala 
370 375 380 

Trp Gin Arg Thr Tyr Val Glu Val Val His Gin Ser Val Ala Pro Asn 
385 390 395 400 

Ser Thr Gin Lys Val Leu Pro Phe Thr Thr Thr Thr Leu Asp Asp lie 

405 410 415 

Leu Lys Ser Phe Ser Asp Val Ser Val lie Arg Val Ala Ser Gly Tyr 
420 425 430 

Leu Leu Met Leu Ala Tyr Ala Cys Leu Thr Met Leu Arg Trp Asp Cys 
435 440 445 

Ser Lys Ser Gin Gly Ala Val Gly Leu Ala Gly Val Leu Leu Val Ala 
450 455 460 

Leu Ser Val Ala Ala Gly Leu Gly Leu Cys Ser Leu lie Gly lie Ser 
465 470 475 480 

Phe Asn Ala Ala Thr Thr Gin Val Leu Pro Phe Leu Ala Leu Gly Val 

485 490 495 

Gly Val Asp Asp Val Phe Leu Leu Ala His Ala Phe Ser Glu Thr Gly 
500 505 510 

Gin Asn Lys Arg lie Pro Phe Glu Asp Arg Thr Gly Glu Cys Leu Lys 
515 520 525 

Arg Thr Gly Ala Ser Val Ala Leu Thr Ser lie Ser Asn Val Thr Ala 
530 535 540 

Phe Phe Met Ala Ala Leu lie Pro lie Pro Ala Leu Arg Ala Phe Ser 
545 550 555 560 

Leu Gin Ala Ala Val Val Val Val Phe Asn Phe Ala Met Val Leu Leu 

565 570 575 

lie Phe Pro Ala lie Leu Ser Met Asp Leu Tyr Arg Arg Glu Asp Arg 
580 585 590 

Arg Leu Asp lie Phe Cys Cys Leu Thr Ser Pro Cys Val Ser Arg Val 
595 600 605 

lie Gin Val Glu Pro Gin Ala Tyr Thr Glu Pro His Ser Asn Thr Arg 
610 615 620 
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Tyr Ser Pro Pro Pro Pro Tyr Thr Ser His Ser Phe Ala His Glu Thr 
625 630 635 640 

His lie Thr Met Gin Ser Thr Val Gin Leu Arg Thr Glu Tyr Asp Pro 

645 650 655 

His Thr His Val Tyr Tyr Thr Thr Ala Glu Pro Arg Ser Glu lie Ser 
660 665 670 

Val Gin Pro Val Thr Val Thr Gin Asp Asn Leu Ser Cys Gin Ser Pro 
675 680 685 

Glu Ser Thr Ser Ser Thr Arg Asp Leu Leu Ser Gin Phe Ser Asp Ser 
690 695 700 

Ser Leu His Cys Leu Glu Pro Pro Cys Thr Lys Trp Thr Leu Ser Ser 
705 710 715 720 

Phe Ala Glu Lys His Tyr Ala Pro Phe Leu Leu Lys Pro Lys Ala Lys 

725 730 735 

Val Val Val lie Leu Leu Phe Leu Gly Leu Leu Gly Val Ser Leu Tyr 
740 745 750 

Gly Thr Thr Arg Val Arg Asp Gly Leu Asp Leu Thr Asp lie Val Pro 
755 760 765 

Arg Glu Thr Arg Glu Tyr Asp Phe lie Ala Ala Gin Phe Lys Tyr Phe 
770 775 780 

Ser Phe Tyr Asn Met Tyr lie Val Thr Gin Lys Ala Asp Tyr Pro Asn 
785 790 795 800 

lie Gin His Leu Leu Tyr Asp Leu His Lys Ser Phe Ser Asn Val Lys 

805 810 815 

Tyr Val Met Leu Glu Glu Asn Lys Gin Leu Pro Gin Met Trp Leu His 
820 825 830 

Tyr Phe Arg Asp Trp Leu Gin Gly Leu Gin Asp Ala Phe Asp Ser Asp 
835 840 845 

Trp Glu Thr Gly Arg lie Met Pro Asn Asn Tyr Lys Asn Gly Ser Asp 
850 855 860 

Asp Gly Val Leu Ala Tyr Lys Leu Leu Val Gin Thr Gly Ser Arg Asp 
865 870 875 880 

Lys Pro lie Asp lie Ser Gin Leu Thr Lys Gin Arg Leu Val Asp Ala 

885 890 895 

Asp Gly lie lie Asn Pro Ser Ala Phe Tyr lie Tyr Leu Thr Ala Trp 
900 905 910 

Val Ser Asn Asp Pro Val Ala Tyr Ala Ala Ser Gin Ala Asn lie Arg 
915 920 925 

Pro His Arg Pro Glu Trp Val His Asp Lys Ala Asp Tyr Met Pro Glu 
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930 935 940 

Thr Arg Leu Arg lie Pro Ala Ala Glu Pro lie Glu Tyr Ala Gin Phe 
945 950 955 960 

Pro Phe Tyr Leu Asn Gly Leu Arg Asp Thr Ser Asp Phe Val Glu Ala 

965 970 975 

lie Glu Lys Val Arg Val lie Cys Asn Asn Tyr Thr Ser Leu Gly Leu 
980 985 990 

Ser Ser Tyr Pro Asn Gly Tyr Pro Phe Leu Phe Trp Glu Gin Tyr lie 
995 1000 1005 

Ser Leu Arg His Trp Leu Leu Leu Ser lie Ser Val Val Leu Ala Cys 
1010 1015 1020 

Thr Phe Leu Val Cys Ala Val Phe Leu Leu Asn Pro Trp Thr Ala Gly 
1025 1030 1035 1040 

lie lie Val Met Val Leu Ala Leu Met Thr Val Glu Leu Phe Gly Met 

1045 1050 1055 

Met Gly Leu lie Gly lie Lys Leu Ser Ala Val Pro Val Val lie Leu 
1060 1065 1070 

lie Ala Ser Val Gly lie Gly Val Glu Phe Thr Val His Val Ala Leu 
1075 1080 1085 

Ala Phe Leu Thr Ala lie Gly Asp Lys Asn His Arg Ala Met Leu Ala 
1090 1095 1100 

Leu Glu His Met Phe Ala Pro Val Leu Asp Gly Ala Val Ser Thr Leu 
1105 1110 1115 1120 

Leu Gly Val Leu Met Leu Ala Gly Ser Glu Phe Asp Phe lie Val Arg 

1125 1130 1135 

Tyr Phe Phe Ala Val Leu Ala lie Leu Thr Val Leu Gly Val Leu Asn 
1140 1145 1150 

Gly Leu Val Leu Leu Pro Val Leu Leu Ser Phe Phe Gly Pro Cys Pro 
1155 1160 1165 

Glu Val Ser Pro Ala Asn Gly Leu Asn Arg Leu Pro Thr Pro Ser Pro 
1170 1175 1180 

Glu Pro Pro Pro Ser Val Val Arg Phe Ala Val Pro Pro Gly His Thr 
1185 1190 1195 1200 

Asn Asn Gly Ser Asp Ser Ser Asp Ser Glu Tyr Ser Ser Gin Thr Thr 

1205 1210 1215 

Val Ser Gly lie Ser Glu Glu Leu Arg Gin Tyr Glu Ala Gin Gin Gly 
1220 1225 1230 

Ala Gly Gly Pro Ala His Gin Val lie Val Glu Ala Thr Glu Asn Pro 
1235 1240 1245 
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Val Phe Ala Arg Ser Thr Val Val His Pro Asp Ser Arg His Gin Pro 
1250 1255 1260 

Pro Leu Thr Pro Arg Gin Gin Pro His Leu Asp Ser Gly Ser Leu Ser 
1265 1270 1275 1280 

Pro Gly Arg Gin Gly Gin Gin Pro Arg Arg Asp Pro Pro Arg Glu Gly 

1285 1290 1295 

Leu Arg Pro Pro Pro Tyr Arg Pro Arg Arg Asp Ala Phe Glu lie Ser 
1300 1305 1310 

Thr Glu Gly His Ser Gly Pro Ser Asn Arg Asp Arg Ser Gly Pro Val 
1315 1320 1325 

Gly Pro Val Leu Thr Thr Leu Gly Thr Gin Arg Pro Pro Pro Trp Ala 
1330 . 1335 1340 

Ala Leu Cys Pro Ala Thr Ala Ser Pro Ser Pro Leu 
1345 1350 1355 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Ser Ala Gly Asn Ala Arg Arg Gly Pro Gly Gin Ala Gly Arg 
1 5 10 15 

Arg Arg Glu Ala Gin Thr Asp Arg Gly Thr Ala Pro Arg Arg Ala Gly 
20 25 30 

Pro Gly Leu Ser Ala Pro Ala Gin Leu Leu Arg Arg Ala Phe Ala Leu 
35 40 45 

Glu Gin lie Ser Lys Gly Lys Ala Thr Gly Arg Lys Ala Pro Leu Trp 
50 55 60 

Leu Arg Ala Lys Phe Gin Arg Leu Leu Phe Lys Leu Gly Cys Tyr lie 
,65 70 75 80 

Gin Lys Asn Cys Gly Lys Phe Leu Val Val Gly Leu Leu lie Phe Gly 

85 90 95 

Ala Phe Ala Val Gly Leu Lys Ala Ala Asn Leu Glu Thr Asn Val Glu 
100 105 " 110 

Glu Leu Trp Val Glu Val Gly Gly Arg Val Ser Arg Glu Leu Asn Tyr 
115 120 125 



23 



Thr Arg Gin Lys lie Gly Glu Glu Ala Met Phe Asn Pro Gin Leu Met 
130 135 140 

lie Gin Thr Pro Lys Glu Glu Gly Ala Asn Val Leu Thr Thr Glu Ala 
145 150 155 160 

Leu Leu Gin His Leu Asp Ser Ala Leu Gin Ala Ser Arg Val His Val 

165 170 175 

Tyr Met Tyr Asn Arg Gin Trp Lys Leu Glu His Leu Cys Tyr Lys Ser 
180 185 190 

Gly Glu Leu lie Thr Glu Thr Gly Tyr Met Asp Gin lie lie Glu Tyr 
195 200 205 

Leu Tyr Pro Cys Leu lie lie Thr Pro Leu Asp Cys Phe Trp Glu Gly 
210 215 220 

Ala Lys Leu Gin Ser Gly Thr Ala Tyr Leu Leu Gly Lys Pro Pro Leu 
225 230 235 240 

Arg Trp Thr Asn Phe Asp Pro Leu Glu Phe Leu Glu Glu Leu Lys Lys 

245 250 255 

lie Asn Tyr Gin Val Asp Ser Trp Glu Glu Met Leu Asn Lys Ala Glu 
260 265 270 

Val Gly His Gly Tyr Met Asp Arg Pro Cys Leu Asn Pro Ala Asp Pro 
275 280 285 

Asp Cys Pro Ala Thr Ala Pro Asn Lys Asn Ser Thr Lys Pro Leu Asp 
290 295 300 

Val Ala Leu Val. Leu Asn Gly Gly Cys Gin Gly Leu Ser Arg Lys Tyr 
305 310 315 320 

Met His Trp Gin Glu Glu Leu lie Val Gly Gly Thr Val Lys Asn Ala 

325 330 335 

Thr Gly Lys Leu Val Ser Ala His Ala Leu Gin Thr Met Phe Gin Leu 
340 345 350 

Met Thr Pro Lys Gin Met Tyr Glu His Phe Arg Gly Tyr Asp Tyr Val 
355 360 365 

Ser His lie Asn Trp Asn Glu Asp Arg Ala Ala Ala lie Leu Glu Ala 
370 375 380 

Trp Gin Arg Thr Tyr Val Glu Val Val His Gin Ser Val Ala Pro Asn 
385 390 395 400 

Ser Thr Gin Lys Val Leu Pro Phe Thr Thr Thr Thr Leu Asp Asp lie 

405 410 415 

Leu Lys Ser Phe Ser Asp Val Ser Val lie Arg Val Ala Ser Gly Tyr 
420 425 430 

Leu Leu Met Leu Ala Tyr Ala Cys Leu Thr Met Leu Arg Trp Asp Cys 



24 



Ser Lys Ser Gin 
450 

Leu Ser Val Ala 
465 

Phe Asn Ala Ala 



Gly Val Asp Asp 
500 

Gin Asn Lys Arg 
515 

Arg Thr Gly Ala 
530 

Phe Phe Met Ala 
545 

Leu Gin Ala Ala 



lie Phe Pro Ala 
580 

Arg Leu Asp lie 
595 

lie Gin Val Glu 
610 

Tyr Ser Pro Pro 
625 

His lie Thr Met 



His Thr His Val 
660 

Val Gin Pro Val 
675 

Glu Ser Thr Ser 
690 

Ser Leu His Cys 
705 

Phe Ala Glu Lys 



Val Val Val lie 
740 




440 

Gly Ala Val Gly 
455 

Ala Gly Leu Gly 
470 

Thr Thr Gin Val 
485 

Val Phe Leu Leu 



lie Pro Phe Glu 
520 

Ser Val Ala Leu 

535 

Ala Leu lie Pro 
550 

Val Val Val Val 
565 

lie Leu Ser Met 



Phe Cys Cys Leu 
600 

Pro Gin Ala Tyr 
615 

Pro Pro Tyr Thr 
630 

Gin Ser Thr Val 
645 

Tyr Tyr Thr Thr 



Thr Val Thr Gin 
680 

Ser Thr Arg Asp 
695 

Leu Glu Pro Pro 
710 

His Tyr Ala Pro 
725 

Leu Leu Phe Leu 




Leu Ala Gly Val 
460 

Leu Cys Ser Leu 
475 

Leu Pro Phe Leu 
490 

Ala His Ala Phe 
505 

Asp Arg Thr Gly 



Thr Ser lie Ser 
540 

lie Pro Ala Leu 
555 

Phe Asn Phe Ala 
570 

Asp Leu Tyr Arg 
585 

Thr Ser Pro Cys 



Thr Glu Pro His 
620 

Ser His Ser Phe 
635 

Gin Leu Arg Thr 
650 

Ala Glu Pro Arg 
665 

Asp Asn Leu Ser 



Leu Leu Ser Gin 
700 

Cys Thr Lys Trp 
715 

Phe Leu Leu Lys 
730 

Gly Leu Leu Gly 
745 



Leu Leu Val Ala 



lie Gly lie Ser 
480 

Ala Leu Gly Val 
495 

Ser Glu Thr Gly 
510 

Glu Cys Leu Lys 
525 

Asn Val Thr Ala 



Arg Ala Phe Ser 
560 

Met Val Leu Leu 
575 

Arg Glu Asp Arg 
590 

Val Ser Arg Val 
605 

Ser Asn Thr Arg 



Ala His Glu Thr 
640 

Glu Tyr Asp Pro 
655 

Ser Glu lie Ser 
670 

Cys Gin Ser Pro 
685 

Phe Ser Asp Ser 



Thr Leu Ser Ser 
720 

Pro Lys Ala Lys 
735 

Val Ser Leu Tyr 
750 
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Gly Thr Thr Arg Val Arg Asp Gly Leu Asp Leu Thr Asp lie Val Pro 
755 760 765 

Arg Glu Thr Arg Glu Tyr Asp Phe lie Ala Ala Gin Phe Lys Tyr Phe 
770 775 780 

Ser Phe Tyr Asn Met Tyr lie Val Thr Gin Lys Ala Asp Tyr Pro Asn 
785 790 795 800 

lie Gin His Leu Leu Tyr Asp Leu His Lys Ser Phe Ser Asn Val Lys 

805 810 815 

Tyr Val Met Leu Glu Glu Asn Lys Gin Leu Pro Gin Met Trp Leu His 
820 825 830 

Tyr Phe Arg Asp Trp Leu Gin Gly Leu Gin Asp Ala Phe Asp Ser Asp 
835 840 845 

Trp Glu Thr Gly Arg lie Met Pro Asn Asn Tyr Lys Asn Gly Ser Asp 
850 855 860 

Asp Gly Val Leu Ala Tyr Lys Leu Leu Val Gin Thr Gly Ser Arg Asp 
865 870 875 880 

Lys Pro lie Asp lie Ser Gin Leu Thr Lys Gin Arg Leu Val Asp Ala 

885 890 895 

Asp Gly lie lie Asn Pro Ser Ala Phe Tyr lie Tyr Leu Thr Ala Trp 
900 905 910 

Val Ser Asn Asp Pro Val Ala Tyr Ala Ala Ser Gin Ala Asn lie Arg 
915 920 925 

Pro His Arg Pro Glu Trp Val His Asp Lys Ala Asp Tyr Met Pro Glu 
930 935 940 

Thr Arg Leu Arg lie Pro Ala Ala Glu Pro lie Glu Tyr Ala Gin Phe 
945 950 955 960 

Pro Phe Tyr Leu Asn Gly Leu Arg Asp Thr Ser Asp Phe Val Glu Ala 

965 970 975 

lie Glu Lys Val Arg Val lie Cys Asn Asn Tyr Thr Ser Leu Gly Leu 
980 985 990 

Ser Ser Tyr Pro Asn Gly Tyr Pro Phe Leu Phe Trp Glu Gin Tyr lie 
995 1000 1005 

Ser Leu Arg His Trp Leu Leu Leu Ser lie Ser Val Val Leu Ala Cys 
1010 1015 1020 

Thr Phe Leu Val Cys Ala Val Phe Leu Leu Asn Pro Trp Thr Ala Gly 
1025 1030 1035 1040 

lie lie Val Met Val Leu Ala Leu Met Thr Val Glu Leu Phe Gly Met 

1045 1050 1055 

Met Gly Leu lie Gly lie Lys Leu Ser Ala Val Pro Val Val lie Leu 
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1060 1065 1070 

lie Ala Ser Val Gly lie Gly Val Glu Phe Thr Val His Val Ala Leu 
1075 1080 1085 

Ala Phe Leu Thr Ala lie Gly Asp Lys Asn His Arg Ala Met Leu Ala 
1090 1095 1100 

Leu Glu His Met Phe Ala Pro Val Leu Asp Gly Ala Val Ser Thr Leu 
1105 1110 1115 1120 

Leu Gly Val Leu Met Leu Ala Gly Ser Glu Phe Asp Phe lie Val Arg 

1125 1130 1135 

Tyr Phe Phe Ala Val Leu Ala lie Leu Thr Val Leu Gly Val Leu Asn 
1140 1145 1150 

Gly Leu Val Leu Leu Pro Val Leu Leu Ser Phe Phe Gly Pro Cys Pro 
1155 1160 1165 

Glu Val Ser Pro Ala Asn Gly Leu Asn Arg Leu Pro Thr Pro Ser Pro 
1170 1175 1180 

Glu Pro Pro Pro Ser Val Val Arg Phe Ala Val Pro Pro Gly His Thr 
1185 1190 1195 1200 

Asn Asn Gly Ser Asp Ser Ser Asp Ser Glu Tyr Ser Ser Gin Thr Thr 

1205 1210 1215 

Val Ser Gly lie Ser Glu Glu Leu Arg Gin Tyr Glu Ala Gin Gin Gly 
1220 1225 1230 

Ala Gly Gly Pro Ala His Gin Val lie Val Glu Ala Thr Glu Asn Pro 
1235 1240 1245 

Val Phe Ala Arg Ser Thr Val Val His Pro Asp Ser Arg His Gin Pro 
1250 1255 1260 

Pro Leu Thr Pro Arg Gin Gin Pro His Leu Asp Ser Gly Ser Leu Ser 
1265 1270 1275 1280 

Pro Gly Arg Gin Gly Gin Gin Pro Arg Arg Asp Pro Pro Arg Glu Gly 

1285 1290 1295 

Leu Arg Pro Pro Pro Tyr Arg Pro Arg Arg Asp Ala Phe Glu lie Ser 
1300 1305 1310 

Thr Glu Gly His Ser Gly Pro Ser Asn Arg Asp Arg Ser Gly Pro Val 
1315 1320 1325 

Gly Pro Val Leu Thr Thr Leu Gly Thr Gin Arg Pro Pro Pro Trp Ala 
1330 1335 1340 

Ala Leu Cys Pro Ala Thr Ala Ser Pro Ser Pro Leu 
1345 1350 1355 

(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9042 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GATCCTGAAT TGAGAAATAT AGATTGAAAC AGAATTCATT ACCATTTAAG CAATCATTAT 60 

TTATGGGGGC GTAATGCGCC TCCGAGTAGG CAATGCTTTT CTTGACATTG TTACTAAGAA 12 0 

TTGTGAATGA TATTTGGGCG TGGATCAACG CCGATTAAAA GCTGCTTTTG CTTCCAGGCG 18 0 

GCCAGAGAAG AGATCCAAAC TTCAACTCCA GCCATAAAAG CAACAACATT TCCGTCTCCC 240 

CCTTGTAGCT CCCCTTCCTC CGGCTCTTCC ACTCTCCACG AAACGGCAAA TGAAGCTCTC 3 00 

AAAGCGAACT GTGCTTCGCT GGTGGTCCAT TGGCAGCTGC CGCCACACAG GCGCTGCTTT 3 60 

TGTGTGTGTG TGTAATATCA ATCTTGCTCT CCCTCTCTTT TTATCTCTCT TGGGAATTGG 42 0 

AGCTGCATGC GAATTGAGCG ACAGCAAAAC GAACTGCAAG T C ATTG AG AG GAG AG C AAAA 480 

ACTCGAGCGC AAGCCAAAGA TAGCGCAATC TGGGGAGAGC GAAATAAAGC TAAAATATGC 54 0 

ATGTTGGAGA AAAAATGCCG CCCATGTCGC CAAAATGCGC CACACGCAGA GTGAGCGGGC 6 00 

GGAGGTGGGA GTAATGGAAA GGGCGATGAG GGAACGATTA GCTTGAAGAG AGAGAACAAC 660 

AAATGAATGT GCTGCAACGT TAGTTCAGGT GAGCGCGTTA GAGAGAGAGT TGTTGTTTTT 72 0 

TGATTGTAAT AGCTCGCTTG GTGGTGGGTC CACATTCACA TCTCCCTCTC CCACTCTTTC 78 0 

TCCCCGAAAG AGAGAGCGGG AGCGAAGGGG CACGAGGGGA GCACGATGAC TATGCAGTTG 84 0 

CATTCAATTT GAATTTCCAT GGTGCTGATG ATTCGAGCGC CAATTTTTTC GAAGAGTTCT 9 00 

TATTTGTTTA CTTCGTTGTT GTTGCCTCAA TTGGAAAGGG AAAATGTGGA ATGCGGAGAA 9 60 

ACACCAGAAG CAAATGCATT TCCATTCATA AATCCAAAGA AGTTTTAAAG ATAACATGTC 102 0 

ATTTGGCTTA AGTTCGTGGT GCACAAAAAA GATCGGTTTG CGGTTGTCGC AT G AAATG AG 108 0 

TTTATTCCAT TGGTATATTA TTATTCAGAA ATTAAAAAAA AACTTGTTTA GTCTATTTTT 114 0 

TTTTTTTAAA TAAAAAAAAA AAATTCTTTT ATAAGTCGAT TTTAGAGTAA ATATTTAAAG 12 00 

ACTACGTCTA ATAAACATAT AATTTGTTCT GTGTTTTAAT TTGCCGGCAA AAACAAACCT 12 60 

ACTTGTGTGG TCCTCGCACA CTCATAACCC CTCGCATATT TGAGATTCAT GGGGCAAGAG 13 2 0 

GCTGCAAAAA CAATGGAAAG GGAAAAGCAG AAACATCCTG CCGCTCATAA TTTAGCATCG 13 8 0 
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GAACATGCAA AAACAGACAT CATCGCATGG GGCAGCAGCA ACAGCCATAA AACCAACACG 14 4 0 

AGCAATGTAA AGCTAACAAA TTTGCCAACA GTTCGCGGCA CGGCTACACA CACACACATG 1500 

CATGCGCAGC CTGCCACGCA CGCGCTTCCC CCAAACAAAT ACACACACAC ACATCTGAGAC 1560 

GAAAGCTCCA TTGGGCAGCG CTGCCGACGC TGAAGGCCGA CATCGGCAGA GCTGAACGTT 162 0 

^GGGTAGGGG ACCACCCACA TCGCTTGGCG GTTTCAGTTT AATGAAGGCA GAAACAAATT 168 0 

TATTTTTGGG TGGTCCACAC TGCAGCGAAA ATAAACTACA GTGGCAACAA CAAACCAGCA 174 0 

GCCAAGGCAC TTTGGGTGGT CCATGCAAAA AAAAAACAAA TTACGGCATG CGAATAACAA 18 0 0 

TAGAAATTAG CGCTCTCGTG GCGGAGCTAT TTGGGTATAT TAGAGCTACA TATTTTATTT 1860 

GTTTATAAAA AGTATAAATG TAAACAATGA GTTCCAAGCA TTAAGTCCGT ATGCTCAACA 192 0 

ATTACATTAT CATTATTATT ATCACTTAAA TATTTACAAA GGATATTTAA ACAGTAATAG 198 0 

ATATATATTT TATTTCTTAA TTTCTGTTAA CATATGTATT TACATTGGTA GTTATTCTTT 2 04 0 

ATTTTGCAAC AAGCATTCAT AAATTTTATA TAACAAACTT GGTATTTTCT CGGAAAAACT 210 0 

CCTGAATCAC CCCTCGGTAT TTTGTGCGTT GAGCTATCGT TAAAGCAGCC CTCGCAGAGA 2160 

GCGTTCTCAA ACCAAAATGG CCGCACACGA AACAAGAGAG CGAGTGAGAG TAGGGAGAGC 222 0 

GTCTGTGTTG TGTGTTGAGT GTCGCCCACG CACACAGGCG CAAAACAGTG CACACAGACG 2 28 0 

CCCGCTGGGC AAGAGAGAGT GAGAGAGAGA AACAGCGGCG CGCGCTCGCC TAATGAAGTT 2 34 0 

GTTGGCCTGG CTGGCGTGCC GCATCCACGA GATACAGATA CATCTCTCAG ACTGCGTGCG 2 4 00 

ATCCTCGAAC GAAACGGTTG TAAGTGCGGA GCGCGACGAC TTGTTATTCG TATTTCCGAC 24 60 

TACTGGCACT CTCTGTGTGT GGTATACTAA CAAGATAGAT ATCACAGAAC TCGTGGAAAA 2 52 0 

GCTAAGATAT TGTACCTCAC GGATGCGAGG CGAAGTTCAT GGATTAAATG CCAGGCAACA 2 58 0 

ACAAAAGCCA GCCAACCAGC CAGTGTTTGT GTGTGTGCGT CGCCAAGTGC AAAGTAAAGT 2 64 0 

AAAGGTAAAA GAGCGAAAGG CGAGAGAGAA AACCGAATAC GTGAGTCGTC CGACTGCCGC 2 7 00 

TTTTCCATGT GTAAAAGATC TGTGAAAATT CTGTCAAATT CCCCTGAGAA ATTGTGCCCA 27 60 

AG AT AAAAC C CGAAAACCGC GTTTTAATCG TCGAAAAAAC C C AG C AAAAG CGAAGCCAGC 282 0 

AATCACAACA AAACAACATA ACGAGAGCTC AGATACACAG CGTGCTCAGT GAGTGAGCGA 2 88 0 

GAGAGCGCGG GAG AG AG C GT CTCTTGATTT AAAATACAAA ATAATTAAAA ATAAAAATGC 2 94 0 

GGAATGCAGT GCAAAATGCA GCCAAACAAA ATACGAGATT CGAATAACAA TTAATCGAAC 3 0 00 

CGAAAGTCCA CGAACAATCC GCACACTGTC TCCCAAGTCT CAGTTCTCAG GACGCAGACG 3 060 

AACGGCAGGC ACTGTAGAAA GACCGATTCC GCAGCACACT CCCATCTGCA CATCTCCGCC 312 0 
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ACGCGATTCC GTCCGGAATC TGGCTATAAA CATAACCATA ATGGACCGCG ACAGCCTCCC 318 0 

ACGCGTTCCG GACACACACG GCGATGTGGT CGATGAGAAA TTATTCTCGG ATCTTTACAT 3 24 0 

ACGCACCAGC TGGGTGGACG CCCAAGTGGC GCTCGATCAG ATAGATAAGG TGAGTGCCCA 3 3 00 

ACTACAGTGA ACTTTCACTG TGAAGGATAG CCATGTGTTG AATTCAATAA TATTCTTGAT 3 3 60 

CGTATTCGGA GGATCCAATT TTAATGCGTA TTTTATGGCA GTGGAGCAAG GCGGGGGAAT 3 42 0 

CTAAAAAAAA AACTAAACGC TAAATTCCGT ATTTTTGTTG CATTTTTCAG GGCAAAGCGC 3 480 

GTGGCAGCCG CACGGCGATC TAT CT G CG AT CAGTATTCCA GTCCCACCTC GAAACCCTCG 3 54 0 

GCAGCTCCGT GCAAAAGCAC GCGGGCAAGG TGCTATTCGT GGCTATCCTG GTGCTGAGCA 3 600 

CCTTCTGCGT CGGCCTGAAG AGCGCCCAGA TCCACTCCAA GGTGCACCAG CTGTGGATCC 3 660 

AGGAGGGCGG CCGGCTGGAG GCGGAACTGG CCTACACACA GAAGACGATC GGCGAGGACG 37 2 0 

AGTCGGCCAC GCATCAGCTG CTCATTCAGA CGACCCACGA CCCGAACGCC TCCGTCCTGC 3 7 80 

ATCCGCAGGC GCTGCTTGCC CACCTGGAGG TCCTGGTCAA GGCCACCGCC GTCAAGGTGC 3 84 0 

ACCTCTACGA C AC CG AATGG GGGCTGCGCG ACATGTGCAA CATGCCGAGC ACGCCCTCCT 3 900 

TCGAGGGCAT CTACTACATC GAG C AG AT CC TGCGCCACCT CATTCCGTGC TCGATCATCA 3 9 60 

CGCCGCTGGA CTGTTTCTGG GAGGGAAGCC AGCTGTTGGG TCCGGAATCA GCGGTCGTTA 4 02 0 

TACCGTAAGT AGTTAATATG TAGTTAATAG CCACATCTTA TAGATTCTAA AGTGAACGTA 4 08 0 

TCCCTTATGA CCATATCCTT TTGCATGATC TACTTTAACC CACAGTACTT CTCTATTCAT 414 0 

ATTAAGGAAT TAATAAAGTA CTTACTTTGC GCTTACCTTT ATTAAATACG ATAGCTTATC 4 2 00 

TTTATAAACT TGCTATCAAG TCGAAAGATA AACGTGACAA GAGTATCTTT GTACTTATCC 4 2 60 

CAGTTGCTTA CCATCGTAAA TAATCTTCTT ATTAATAAAT ATTCGTAAAT AAATATTCTT 4 3 20 

AACTCAACAA ATCCATCTTT ATTATTGTTA CTCCTCTACA GAGGCCTCAA CCAACGACTC 4 3 80 

CTGTGGACCA CCCTGAATCC CGCCTCTGTG ATGCAGTATA TGAAACAAAA GATGTCCGAG 444 0 

G AAAAG AT C A GCTTCGACTT CGAGACCGTG GAGCAGTACA TGAAGCGTGC GGCCATTGGC 4 500 

AGTGGCTACA TGGAGAAGCC CTGCCTGAAC CCACTGAATC CCAATTGCCC GGACACGGCA 45 60 

CCGAACAAGA ACAGCACCCA GCCGCCGGAT GTGGGAGCCA TCCTGTCCGG AGGCTGCTAC . 4 62 0 

GGTTATGCCG CGAAGCACAT GCACTGGCCG GAGGAGCTGA TTGTGGGCGG ACGGAAGAGG 4 68 0 

AACCGCAGCG GACACTTGAG GAAGGCCCAG GCCCTGCAGT CGGTGGTGCA GCTGATGACC 47 4 0 

G AG AAG G AAA TGTACGACCA GTGGCAGGAC AACTACAAGG TGCACCATCT TGGATGGACG 48 00 

CAGGAGAAGG CAGCGGAGGT TTTGAACGCC TGGCAGCGCA ACTTTTCGCG GGAGGTGGAA 48 60 
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CAGCTGCTAC GTAAACAGTC GAGAATTGCC ACCAACTACG ATATCTACGT GTTCAGCTCG 4 92 0 

GCTGCACTGG ATGACATCCT GGCCAAGTTC TCCCATCCCA GCGCCTTGTC CATTGTCATC 4 98 0 

GGCGTGGCCG TCACCGTTTT GTATGCCTTT TGCACGCTCC TCCGCTGGAG GGACCCCGTC 504 0 

CGTGGCCAGA GCAGTGTGGG CGTGGCCGGA GTTCTGCTCA TGTGCTTCAG TACCGCCGCC 510 0 

^GATTGGGAT TGTCAGCCCT GCTCGGTATC GTTTTCAATG CGCTGACCGC TGCCTATGCG 5160 

GAGAGCAATC GGCGGGAGCA GACCAAGCTG ATTCTCAAGA ACGCCAGCAC CCAGGTGGTT 52 2 0 

CCGTTTTTGG CCCTTGGTCT GGGCGTCGAT CACATCTTCA TAGTGGGACC GAGCATCCTG 52 80 

TTCAGTGCCT GCAGCACCGC AGGATCCTTC TTTGCGGCCG CCTTTATTCC GGTGCCGGCT 53 40 

TTGAAGGTAT TCTGTCTGCA GGCTGCCATC GTAATGTGCT CCAATTTGGC AGCGGCTCTA 54 00 

TTGGTTTTTC CGGCCATGAT TTCGTTGGAT CTACGGAGAC GTACCGCCGG CAGGGCGGAC 54 60 

ATCTTCTGCT GCTGTTTTCC GGTGTGGAAG GAACAGCCGA AGGTGGCACC TCCGGTGCTG 552 0 

CCGCTGAACA ACAACAACGG GCGCGGGGCC CGGCATCCGA AGAGCTGCAA CAACAACAGG 558 0 

GTGCCGCTGC CCGCCCAGAA TCCTCTGCTG GAACAGAGGG CAGACATCCC TGGGAGCAGT 564 0 

CACTCACTGG CGTCCTTCTC CCTGGCAACC TTCGCCTTTC AGCACTACAC TCCCTTCCTC 57 0 0 

ATGCGCAGCT GGGTGAAGTT CCTGACCGTT ATGGGTTTCC TGGCGGCCCT CATATCCAGC 57 6 0 

TTGTATGCCT CCACGCGCCT TCAGGATGGC CTGGACATTA TTGATCTGGT GCCCAAGGAC 58 2 0 

AGCAACGAGC ACAAGTTCCT GGATGCTCAA ACTCGGCTCT TTGGCTTCTA CAGCATGTAT 58 8 0 

GCGGTTACCC AGGGCAACTT TGAATATCCC ACCCAGCAGC AGTTGCTCAG GGACTACCAT 59 4 0 

GATTCCTTTG TGCGGGTGCC AC ATGTG AT C AAGAATGATA ACGGTGGACT GCCGGACTTC 60 00 

TGGCTGCTGC TCTTCAGCGA GTGGCTGGGT AATCTGCAAA AAGATATTCG ACGAGGAATA 60 60 

CCGCGACGGA CGGCTGGACC AAGGAGTGCT GGTTCCCAAA CGCCAGCAGC GATCGCCATC 612 0 

CTGGCCTACA AGCTAATCGT GCAAACCGGC CATGTGGACA ACCCCGTGGA CAAGGAACTG 618 0 

GTGCTCACCA ATCGCCTGGT CAACAGCGAT GGCATCATCA ACCAACGCGC CTTCTACAAC 62 4 0 

TATCTGTCGG CATGGGCCAC CAACGACGTC TTCGCCTACG GAGCTTCTCA GGTGGGTCTT 63 00 

CTTATTAAAT TAAATTAAAT TAAATTAAAT TAGATCGCCT TAGTTCTCCT CATATGTACA 63 6 0 

TACATATTAT AACTTATCGC ACTCCAAAGT TAAAGATTAC TAAATGTGTG TGTATCTTTA 64 2 0 

TTCTTACAGG GCAAATTGTA TCCGGAACCG CGCCAGTATT TTCACCAACC CAACGAGTAC 64 8 0 

GATCTTAAGA TACCCAAGAG TCTGCCATTG GTCTACGCTC AGATGCCCTT TTACCTCCAC 654 0 

GGACTAACAG ATACCTCGCA GATCAAGACC CTGATAGGTC ATATTCGCGA CCTGAGCGTC 6600 
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AAGTACGAGG GCTTCGGCCT GCCCAACTAT CCATCGGGTG AGTCGGAAAT GAGTACTTCA 6 660 

TACATGGGGC CCAACTAACA GTCGATTTAT TT AT C G C C AG GCATTCCCTT CATCTTCTGG 67 2 0 

GAGCAGTACA TGACCCTGCG CTCCTCACTG GCCATGATCC TGGCCTGCGT GCTACTCGCC 67 80 

GCCCTGGTGC TGGTCTCCGT GCTCCTGCTC TCCGTTTGGG CCGCCGTTCT CGTGATCCTC 684 0 

-AGCGTTCTGG CCTCGCTGGC CCAGATCTTT GGGGCCATGA CTCTGCTGGG CATCAAACTC 6900 

TCGGCCATTC CGGCAGTCAT ACTCATCCTC AGCGTGGGCA TGATGCTGTG CTTCAATGTG 69 60 

CTGATATCAC TGGTGAGTCT TCATTTCTGG CTGGACCATT AAGAGCTTCG GAGTGAGTCT 7 02 0 

TCATTTCTGG CTGGACCATT AAGAGCTTCG GAGTGAGTCT TCATTTCTGG CTGGACCATT 7 080 

AAGAGCTTCG GATTTTCCAG AGATATCCCA AGACTTTTCA TTGGATCCTC TTCAGCACAC 714 0 

ATTAATTGCT TATCTTTCCG ATTCTAGGGC TTCATGACAT CCGTTGGCAA CCGACAGCGC 72 0 0 

CGCGTCCAGC TG AG CAT G C A GATGTCCCTG GGACCACTTG TCCACGGCAT GCTGACCTCC 7 2 60 

GGAGTGGCCG TGTTCATGCT CTCCACGTCG CCCTTTGAGT TTGTGATCCG GCACTTCTGC 7 320 

TGGCTTCTGC TGGTGGTCTT ATGCGTTGGC GCCTGCAACA GCCTTTTGGT GTTCCCCATC 7 3 80 

CTACTGAGCA TGGTGGGACC GGAGGCGGAG CTGGTGCCGC TGGAGCATCC AGACCGCATA 7 44 0 

TCCACGCCCT CTCCGCTGCC CGTGCGCAGC AG C AAGAG AT CGGGCAAATC CTATGTGGTG 7 500 

CAGGGATCGC GATCCTCGCG AGGCAGCTGC CAGAAGTCGC ATCACCACCA CCACAAAGAC 7 56 0 

CTTAATGATC CATCGCTGAC GACGATCACC GAGGAGCCGC AGTCGTGGAA GTCCAGCAAC 7 62 0 

TCGTCCATCC AGATGCCCAA TGATTGGACC TACCAGCCGC GGGAACAGCG ACCCGCCTCC 7 68 0 

TACGCGGCCC CGCCCCCCGC CTATCACAAG GCCGCCGCCC AGCAGCACCA C C AG C AT C AG 77 4 0 

GGCCCGCCCA CAACGCCCCC GCCTCCCTTC CCGACGGCCT ATCCGCCGGA GCTGCAGAGC 7 8 00 

ATCGTGGTGC AGCCGGAGGT GACGGTGGAG ACGACGCACT CGGACAGCAA CACCACCAAG 7860 

GTGACGGCCA CGGCCAACAT CAAGGTGGAG CTGGCCATCC CGGCAGGCGG TGCGCAGCTA 7 92 0 

TAACTTTACG AGTT AG C ACT AGCACTAGTT CCTGTAGCTA TTAGGACGTA TCTTTAGACT 7 98 0 

CTAGCCTAAG CCGTAACCCT ATTTGTATCT GTAAAATCGA TTTGTCCAGC GGGTCTGCTG 8 04 0 

AGGATTTCGT TCTCATGGAT TCTCATGGAT TCTCATGGAT GCTTAAATGG CATGGTAATT 810 0 

GGCAAAATAT CAATTTTTGT GTCTCAAAAA GATGCATTAG CTTATGGTTT CAAGATACAT 816 0 

TTTTAAAGAG T C CG C C AG AT ATTTATATAA AAAAAATCCA AAATCGACGT ATCCATGAAA 8 22 0 

ATTG AAAAG C TAAGCAGACC CGTATGTATG TATATGTGTA TGCATGTTAG TTAATTTCCC 8 280 

GAAGTCCGGT ATTTATAGCA GCTGCCTTCC GCGCCCCCCT TCCCTTGAAA TGAACACCCT 8 34 0 
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TCCAGCCACG 


CCCCACCGCC 


CCTCTGCGTA 


GCAGCTTTGT 


ATGTATGTAG 


TATGCTAGCA 


8400 


CCTAAGGAAT 


ACTTAAACTT 


AGAGATATTT 


ATTGTAACAC 


ACGCAAAACA 


CACACAATGT 


8460 


ACTTACATAT 


AATTCAATGC 


GAGATTCACC 


CACACAAAAA 


GGAAACACAA 


CAAACTAGTA 


8520 


ATTGTAGCTC 


GTAATTTAGT 


TTAAATATGT 


TACATAAAAC 


ACAAGGACTT 


GAACCAAAAT 


8580 


-AGTATCGCTT 


AAACGGAAAC 


GAGAGAAACG 


AGAAAAAATA 


ACTATTACTT 


AATCAACTAC 


8640 


AAGAGAGATA 


TCCCTCCTCC 


CCTAACCGTA 


CTTACAACCA 


AAATAAAACA 


AGAGTATAAG 


8700 


CATAAAAATG 


GAAAACGAAG 


CGAGGAACGA 


TTGTAAACGC 


GGTCATTTAT 


CCTGTACATT 


8760 


TGTTGCCCGA 


AGACTGACTG 


TCTTTTTTTT 


AATAAAAATA 


TATATTATAC 


AGTTTTTTAA 


8820 


AAGCGAAATT 


CATGACTTTT 


TTTTAACAGT 


GAGCAGAGAA 


CAAAAGAAAC 


GGAAGTTTTC 


8880 


GCTGTATCAA 


TAAAAAGATT 


CCATTTTTTT 


AATAAATTGT 


AAAAATCCTA 


AAAAAAAGAA 


8940 


GACTACAAAA 


GTTTAAATTT 


TTATACGTTA 


TTGATAAACT 


TTTATACACG 


AAAATACTTG 


9000 


TACTTAGCTA 


TGATCAACTC 


CTTGGCTTAA 


GTCTCGGGTA 


AG 




9042 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

lie lie Thr Pro Leu Asp Cys Phe Trp Glu Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu lie Val Gly Gly 
1 5 
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• 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Pro Phe Phe Trp Glu Gin Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGACGAATTC AAGTCACATT GG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 base pairs 

(B) TYPE: nucleic- acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGACGAATTC CTCCCAAACA TC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GGACGAATTC TGATGTTTGG GA 
*(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GGACGAATTC TGATGTTTGG GA 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
CATACCAGCC AAGCTTGTCG GCCATGCAT 



35 



